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(57) Abstract: A method for estimating a receiver's location (X) in a wireless communication environment (RN) having several 
channels. Each channel has at least one signal parameter (V) that varies with location (X) differently firom the other channels. A 
set of calibration data (CD) is determined for each calibration point, each set comprising the location (X) and at least one measured 
signal parameter (V) for each of several channels. The calibration data (CD) serve as a basis for a statistical model (SM) of the signal 
parameters (V) versus a receiver's location. A set of observed signal parameters (CO) is determined, the set comprising at least one 
signal parameter (V) for each of several channels at the receiver's location (X). A location estimate (LE) approximating the location 
(X) of the receiver (R) is determined on the basis of the statistical model (SM) and the set of observed signal parameters (CO). 
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Location estimation in wirdess teieeommunication networl(s 

Background of the invention 

The invention relates to metiiocis and equipment for estimating a 
receiver's location in a wireless telecommunication environment, ie one or 

5 more networks which may be radio, microwave or optical networks. The one or 
more networks communicate at a plurality of channels simultaneously. Such a 
location estimation can be used to provide a. wide variety of location- 
dependent services. 

US patent 6112 095 to Mati Wax et al. discloses a method for pro- 

10 viding a set of likely locations of a transmitter in a cellular network, such as 
AMPS or CDMA. A problem with the technique disclosed in the Wax patent is 
that it requires additional hardware at the network side, such as an antenna ar- 
ray which is equipped to measure an angular direction relative to a base sta- 
tion. In other words, to determine a mobile station's location, infomiation on 

15 the network infrastructure must be available and the mobile station must 
transmit something to have Its location estimated. 

Disclosure of the invention 

An object of the invention is to solve the above problems. In other 
words, the mechanism according to the invention should be able to estimate a 
20 receiver's location in a wireless telecommunication network even without prior 
knowledge of the network infrastmcture (such as the locations of the base sta- 
tions). 

This object is achieved with a method and equipment which are 
characterized by what is disclosed in the attached independent claims. Pre- 
25 ferred embodiments of the invention are disclosed in the attached dependent 
claims. 

The invention is based on the surprising idea that it Is possible to 
estimate a receiver's location with reasonable confidence without knowledge 
of the infrastructure of the receiver's wireless environment, ie the network(s) 

30 received by the receiver. For example, the technique disclosed in the above- 
referenced Wax patent relies on the cellular network's base station configura- 
tion, including the location of the base stations. It is indeed surprising that the 
technique according to the invention Is feasible. The fact that it is surprising is 
apparent as soon as one walks around with a mobile phone having a field 

35 strength indicator. In some places, a shift of 20 to 30 cm changes the field 
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strength dramatically. Evidently, there must be a vast number of locations with 
near-identical field strength. One would expect that calibratihg a location esti- 
mation system requires field strength (or other signal parameter) measure- 
ments at locations very close to each other, and that huge databases would be 

5 required to store these measurements. Atmospheric conditions, cityscapes 
and network configurations change continuously. At first sight, it would seem 
that the databases will deteriorate rapidly, unless constantly updated. How- 
ever, computer simulations show that a technique based on measurements at 
several channels (frequencies) is surprisingly robust Also, calibration data can 

10 be collected automatically at various conditions. 

One aspect of the invention is a method for estimating a location of 
a receiver In a wireless telecommunication environment, the telecommunica- 
tion environment comprising a plurality of channels for sirhultaneous commu- 
nication, each channel having at least one signal parameter that varies with lo- 

15 cation differently from the other channels. The method can be implemented by 
the following steps: 

1) for each of a plurality of calibration points in the wireless tele- 
communication environment, detemfiining a set of calibration data, each set of 
calibration data comprising the location of the respective calibration point and 

20 at least one measured signal parameter for each of several channels at that 
calibration point; 

2) on the basis of the sets of calibration data, maintaining a statisti- 
cal model of the signal parameters of the several channels versus a receiver's 
location in the wireless telecommunication network; 

25 3) measuring at least one signal parameter for each of several 

channels at the receiver; and 

4) estimating the location of the receiver on the basis of the statisti- 
cal model and the measured signal parameters of the several channels at the 
receiver. 

30 Another aspect of the invention is an anangement for carrying out 

the above method. The arrangement can be embodied as a receiver compris- 
ing means for detemilning sets of observed signal parameters, each set com- 
prising at least one obsen/ed signal parameter for each of several channels at 
the location of the receiver. The receiver may itself comprise a location calcu- 

35 lation module for determining a location estimate approximating the location of 
the receiver on the basis of said sets and a statistical model of the signal pa- 
rameters of the several channels versus a receiver's location in a wireless 
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telecommunication environment. Alternatively, iiie recisiver may cohvey the 
sets to an extemallocatlon calculation module. 

The term 'receiver' means that the device whose location is being 
estimated does not have to transmit when its location is being estimated, in 

5 other words, it suffices that the device is making observations of its wireless 
environment. For example, a GSM phone does not have to receive a traffic 
channel. Rather it malces observations at all available frequencies. The device 
may also have, and typically has, transmitting ability, but it is not necessary for 
all embodiments of the invention, and the invention can be used to estimate 

10 the location of a pager or a broadcast receiver. Because transmission capabil- 
ity is not essential to location estimation according to the invention, the re- 
ceiver may exploit signal parameters of networks it is not attached to. For ex- 
ample, a GSM phone attached to one GSM network may exploit signal 
strength values of other GSM networks. 

15 The term 'environment* means that the receiver can receive (make 

observations of) at least one network, but it can receive more than one. For 
example, a GSM phone may observe several operators' GSM networks. A 
more advanced receiver may obsen/e many types of networks, such as cellu- 
lar networks and broadcast networks. 

20 A "wireless' environment means that the one or more networks may 

be radio, microwave or optical networks. Also, the set of networks received by 
the receiver must communicate at a plurality of channels simultaneously, and 
the pluiBlity of channels must comprise a subset of channels such that each 
channel in the subset has at least one signal parameter that varies with loca- 

25 tlon differently from the other channels in the subset. This means that several 
channels having signal parametere with near-identical dependence fixjm loca- 
tion, such as channels from a common transmitting antenna, do not nomially 
give sufficient infomnation for reliable location estimation. Nonmally, signals 
from at least three transmitting stations are required. Examples of suitable 

30 networi^s are cellular networks (such as GSM, GPRS, UMTS, etc.), broadcast 
networi^s (analogue audio, DAB or DVB), wireless local-area networi^s (WLAN) 
or short-range microwave networi«, such as Bluetooth. 

A 'location' may have one to three dimensions. A one-dimensional 
presentation of location may be sufficient In trains and the like. Two- or three- 

35 dimensional presentations of location are much more useful, however. In a 
two-dimensional presentation, the receiver is assumed to be substantially at 
earth level. Actually the height does not matter as long as the calibration data 
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\s measured at the same height (such £id ground level/ t3th floor, etc.) as the 
actual observations. Additionally, the calibration data may comprise a presen- 
tation of time. This means that the wireless environment, ie its signal parame- 
ters, vary with time. In other words, the calibration data comprises, in addition 

5 to ttie signal parameters, one to tiiree location coordinates and, optionally, a 
presentation of time. 

The temi 'calibration data', as used herein, comprises calibration 
measurements (ie, measured signal values) and the location (and, optionally, 
time) at which the measurements were made. 

10 The temi 'statistical model' means that tile individual sets of calibra- 

tion data are not needed to calculate an individual receiver's location. The dif- 
ference between a statistical model and the sets of calibration data can be il- 
lustrated by the following example. Assume that we have a number of {x, y} 
pairs such that there is some dependence between x and y. The y value at a 

15 location x can be calculated on tfie basis of ail tiie {x, y} pairs. A much faster 
way to predict the value of y given a value of x is to calculate a mathematical 
function y = f(x). In tills example, tiie function f is the statistical model. In ottier 
words, the value of y given a value of x is calculated without reference to the 
indmdual {x, y} pairs. Location estimation on the basis of the statistical model 

20 is faster and requires less storage space than location estimation on the basis 
of the individual sets of calibration data. 

The statistical model can have a large variety of different implemen- 
tations, such as probabilistic models, neural networlcs,. fuzzy-logic systems, 
kernel estimators, support vector machines, decision trees, regression trees, 

25 Kalman filters and other statistical filtering mettiods, wavelets, splines, induc- 
tive logic programming methods, finite mbcture models, hidden Markov models, 
etc. As used in tills context, the temi 'statistical model' may also refer to a mix- 
ture of several statistical (sub)modeis. 

The tenn 'channel' should have a wide interpretation, meaning 

30 more or less the same as a frequency or frequency band. The receiver does 
not have to communicate on the channel, as long as the receiver (or an at- 
tached measuring apparatus) can measure at least one signal parameter of 
that channel. In TDMA systems, each frequency has several timeslots, each of 
which canries one channel. As far as the invention Is concerned, all timeslots 

35 having the same frequency give identical infomnation, and any one of them 
can be used as a 'channel'. If the measured signal parameter is signal 
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strength, the receiver does not even have to be able to interpret the contents 
of the channel. 

An illustrative but non-exhaustive list of the signal parameters vary- 
ing with location comprises signal strength, timing advance and em)r ratio. The 

5 list may also comprise the availability of certain channels, but this can be seen 
as a special case in which the signal strength and/or error ratio is quantified to 
a yes/no question. If directional antennas are used, the direction of the radio 
beam(s) can be used as well, thus the measured signal parameters do not 
have to conresporid to a certain channel, but they can be derived values. For 

10 example, a measured parameter set may be or comprisie a vector V = [V1 , V2, 
V3...] in which VI, V2 etc. are the indices of the best, second best, etc. avail- 
able channel. For the purposes of clarity, however, we will use examples In 
v\^idi the signal parameters are related to certain channels. 

Each set of calibration data comprises the location of the respective 

15 calibration point and at least one measured signal parameter for each of sev- 
eral channels at that calibration point. Calibration points are points whose 
location and signal parameters are known or measured. The calibration meas- 
urements are typically detemnined by means of fixed and/or mobile calibration 
receivers. Fixed calibration receivers can be attached to buildings, traffic signs, 

20 lampposts and the like. l\/lobiIe calibration receivers can be transported with 
persons or in vehicles. The calibration receivers measure the signal parame- 
ters like the actual receivers do. The measured signal parameters can be 
transferred to the statistical model via wired or wireless transmission (=on-line) 
or by moving a detachable media, such as a memory disk, tape or card (=off- 

25 line). 

Location estimation can take place at the receiver site or at the 
network site. If the location is estimated at the receiver site, the receiver (or an 
attached computer) must have access to the statistical model. Witti cun-ent 
technology, a feasible statistical model can be compressed to a size which is 

30 manageable in a laptop or painrtop computer. The model can be updated 
while ttie computer is connected to ttie Internet, for example. Altematively, the 
model can be supplied on a detachable memory, such as a CD-ROM or DVD- 
ROM. In the future, even a mobile phone will have sufficient memory for hold- 
ing the statistical model. The model can be updated by means of a data call 

35 via a fast connection, for example. If the receiver site stores a copy of the sta- 
tistical model, it needs no transmission capability, and the actual receiver can 
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be a broadcast receiver, a pager or a dedicated add-on card for a laptop corri- 
put^r, similar in appearance to current GSM attachment cards for laptops. 

Alternatively, ttie receiver may be part of a transceiver, such as a 
mobile phone or a WL^N or Bluetooth interface attached to a portable or 

5 handheld computer. In this case, the transceiver may send the measurement 
results to the networic which fon/vards the results to a location sen/er. Depend- 
ing oh the type of transceiver, the measurements can be sent in a short mes- 
sage, via a data call or a WAP or WLAN connection, for example. The location 
server can send the transceiver its location estimate over a similar connection. 

10 According to a prefen^ emt)pdimeht of the invention, the signal 

parameter measurements (the calibration measurements and/or the receiver's 
current obsen/ations) are quantified to a relatively small number of classes, 
such as two to five classes. In other words, the granularity of the measure- 
ments is increased. At first sight, such granularity increase seems to lose in- 

1 5 formation. For instance, assume that the signal strength of a certain channel at 
a certain location is 34 units on a scale of 0 to 100 (the actual unit is inele- 
vantj. Instead of storing the result of 34 units, we only store the fact that the 
rrieasurement was between 25 and 50, ie a value of 1 on a scale of 0 to 3. It 
would seem that a value of 34 on a scale of 0 - 1 00 can better predict the sig- 

20 nal strength In the neighbourhood of that location than a value of 1 on a scaje 
of 0 to 3 does. However, in many cases increased granularity results in in- 
creased location accuracy. One reason for this is ttiat on a high-resolution 
scale, there are many values that occur relatively seldom, whereas on a low- 
resolution scale, all possible values occur relatively frequently. 

25 An advantage of the invention is that prior infomiation on the net- 

\Nork infrastmcture is not necessary (although it may be useful). This means 
that a location service according to the invention is not tied to networi^ opera- 
tors. Even if the location service according to the invention is maintained by a 
networi< operator, that operator can exploit observations from other operators' 

30 networics without prior information on their infrastructure. The invention is ap- 
plicable in a wide variety of network techniques, such as cellular networks, 
broadcast networks or wireless local-area networks. 

Brief description of the drawings 

The invention will be described In more detail by means of prefened 
35 embodiments with reference to the appended drawing wherein: 
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Figure 1 illustrates various graphis of signal parameter versus re- 
ceiver location; 

Figure 2 is a block diagram illustrating the general concept of the 

invention; 

5 Figure 3 is a block diagramjilustrating a typical calibration receiver 

for detemiining calibration measurements; 

Figures 4A and 4B are block diagrams illustrating mobile receivers 
whose location isf to be estimated; and 

Figure 5 illustrates tiie structure of a statistical model. 

10 Detailed description of the Invention 

Figure 1 illustrates various graphs of signal parameter versus re- 
ceiver location. The horizontal axis represents the (on&<limensional) location 
of a receiver. The vertical axis represents a signal parameter V (such as signal 
strength, or enror ratio) measured by a receiver. Graphs A and B depict signal 

15 parametors of two channels. In this hypotiietical example, we have 10 data 
points Di to Dio measured at location Xi to Xio, respectively. Both graphs A 
and B share the data points Di to Dio having the respective locations Xi to Xio 
and the signal parameter value Vq. Figure 1 gives a faint idea of the difficulties 
in implementing the invention. Not only is the parameter value Vo common to 

20 1 0 different locations (in this example), but the 1 0 locations could be explained 
equally well by botii graphs A and B. The well-known Nyquist criterion states 
that a signal can be fully reconstructed if sampled at more than twice its high- 
est frequency component. If the graphs A and B represent, say, field strengtii 
in a GSIVI networi^ having a nominal frequency of 900 MHz, the spatial fre- 

25 quency of the graphs A and B has a wavelength of approximately 30 cm. Ac- 
cordingly, the signal parameters should be sampled at points less than 15 cm 
apart, which is cleariy an impossible task. But if the signal parameters are 
sampled at points more than half a wavelength apart, the graphs A and B can- 
not be reconstructed, as evidenced by the fact that between points Xe and Xio 

30 the graphs A and B have no similarity whatsoever. 

The reason that the present invention works in practice stems from 
the fact that as the number of channels increases, tiie number of locations 
where the channels behave as described above decreases rapidly, and so it 
becomes increasingly unlikely tiiat any two points cannot be distinguished 

35 from each other based on the measured parameters. 
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Figure 2 is a b|ocl< diagram illustratlhg the general concept of the 
invention. In Figure 2, the invention is Impiemented as a compact location es- 
timation module LEM, although more disfributed iniplementafions are eiqually 
possible. An essential feature of the invention is a statistical model SM of the 

5 receiver's wireless environment, the model being able to predict the receiver's 
location given a plurality of cunent observations at tiie receiver site. The statis- 
tical model SM is built and maintained by a model construction module MCM. 
on the basis of calibration data CD and, optionally, on the basis of prior infor- 
mation PI Of the wireless environment The optional prior infonnation PI may 

10 comprise information on networi< infrastmcture, such as the locations and ra- 
dio parameters of base stations. The locations at which calibration measure- 
ments are collected are called calibration points. The calibration data CD 
comprises data records each of which comprises the location X of the calibra- 
tion point in question and the set of signal parameters V measured at that caii- 

15 bration point. Optionally, the calibration data records may also comprise the 
time at which the measurement was made, in case the signal parameters vary 
with time. The location X can be expressed in any absolute or relative coordi- 
nate system; In special cases, such as trains, highways, tunnels, watenVays or 
the like, a single coordinate may be sufficient, but nomnaily two or three co- 

20 ordinateis will be used. The reference sign X denotes the set of all coordinates 
of the location. 

it should be noted that the temi training data' is often used in the 
context of such statistical models. In the context of this invention, the tenn 
'calibration' is preferred, because 'training' may convey the idea that the model 

25 is ready after initial training, whereas 'calibration' better conveys the Idea that 
the model may have to be updated reguiariy as the conditions change. 

There is also a location calculation module LCM for producing a lo- 
cation estimate LE on the basis of the receiver's current observations CO and 
the statistical model SM. Technically, the 'measurements' and 'observations' 

30 can be performed similariy, but to avoid confusion, the term 'measurement is 
generally used for the calibration measurements, and the signal parameters 
obtained at the current location of the receiver are called 'obsen/ations'. The 
receiver's most recent set of observations is called current observations. The 
location calculation module LCM or a separate estimate interpretation module 

35 EIM may also use the receiver's observation history OH to interpret the loca- 
tion estimate. In other \Airords, the observation history OH can be used to re- 
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solve ambiguities in cases vyiiere a set of observations can be eixplajned by 
two or more iocatiohs with substahtiaily equal probability. 

Figure 3 is a blocl< diagram illustrating a typical calibration receiver 
CR for detemilning tiie calibration measurements in the calibration data CD 
5 shown in Figure 2. Figure 3 shows a mobile calibration receiver comprising a 
portable computer (or data processor) PC-C, a mobile station MS-C (such as a 
GSM, GPRS or UMTS mobile phone) and a location receiver, such as a GPS 
(global positioning system) device. The suffixes -C stand for calibration re- 
ceiver, to distinguish the corresponding parts of the actual receiver R in Figure 

10 4. For clarity, the calibration receiver's main modules PC-C, MS-C and LR are 
shown separately, although the two latter modules are available as PC cards 
which can be inserted into a card socl<et in a typical laptop computer. The 
calibration receiver CR observes the radio signal parameters of the available 
base stations BS in a cellular radio network RN. The interface between the ra- 

15 dio networl< RN and the mobile station MS-C is called a radio interface Rl. If 
the radio interface Rl is bidirectional, the calibration receiver CR may send its 
observations to the location estimation module LEM via the same radio inter- 
face Rl. Alternatively, the calibration receiver's portable computer PC-C may 
store the observations on a detachable memory DM medium, such as a re- 

20 cordable CD-ROM disk, which is later brought off-line to the location estima- 
tion module LEM. 

The location receiver LR of the calibration receiver CR can be en- 
tirely conventional, for example a commercial GPS (global positioning system) 
receiver, as long as it can output the measured coordinates to an attached 

25 computer or other data processor. The portable computer can also be a con- 
ventional, suitably programmed computer. Only the mobile station MS-C may 
need modifications to its hardware or finmware (its ROM contents). Modifica- 
tions may be needed, depending on how many signal parameters the mobile 
station measures. For example, a conventional GSM phone monitors, in addi- 

30 tion to Its cunrently active cell, some parameters of its neighbouring cells, but 
the neighbouring cells are not measured as extensively as the active cell. Only 
when a GSM phone is having an active call, does it monitor the neighbouring 
cells as extensively as its active cell. For the purposes of the invention, it 
would be beneficial to modify the mobile station's cell monitoring routines such 

35 that it monitors the available cells as extensively as possible. 
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Naturally, the raHbratlon receiver CR can ^OT^ 
mobile station for mdnitoring different types of networks or different operator's 
network. For monitoring broadcast networks, tiie calibration receiver CR 
siipuid also comprise a scanning broadcast receiver (not siiown separately). 

5 Altematively, the mobile station US can be a mujti-mode device capable of re- 
ceiving cellular networks and broadcast networks. 

Calibration receivers, like the one shown in Figure 3, can be earned 
along In vehicles or with pereons. Fixed calibration receivere, which do not 
need a GF>S receiver, can be attached to buildings, traffic signs, lampposts 

10 and the like. As an altemative to using a separate location receiver, the loca- 
tion of the calibration receiver can be detemiined by one or more of the follow- 
ing techniques: showing the receiver's tocation on a digitized map; entering a 
street (or other) address and converting it to a location by means of a suitable 
database; or using other known ibcatiohs, such as stops of public vehicles. 

15 Figure 4A is a block diagram illustrating a typical mobile receiver 

whose locatiori is to be estimated. A simple embodiment of a receiver R com- 
prises only a suitably programmed mobile station MS. For some embodiments, 
the receiver R may also comprise a portable computer (or data processor) PC. 
Again, the temn 'receiver' implies that the device is receiving when its location 

20 is being estimated although, in practice, most embodiments will also have 
transmitting capability. The embodiment shown in Figure 4A does not contain 
the statistical model SM. Accordingly, the receiver R must send its current ob- 
servation set CO to the location estimation module LEM via the base station 
BS it is connected to. the location estimation module LEM retums the receiver 

25 its location estimate LE via the radio interface Rl. 

Figure 4B shows an altemative embodiment in which the receiver's 
attached computer PC receives a copy of the statistical model SM on a de- 
tachable memory DM, such as a CD-ROM disk, and the receiver is able to de- 
temnine its own location without transmitting anything. As a yet further aitema- 

30 tive (not shown separately), the receiver's attached computer PC may receive 
the statistical model via an Internet (or any other data) connection to the loca- 
tion estimation module LEM. Future wideband mobile stations may be able to 
receive the statistical model via the radio Interface Rl. A hybrid of the tech- 
nologies may also be used such that the receiver receives an initial statistical 

35 model via a wired connection or on the detachable memory, but later updates 
to the model are sent via the radio interi'ace. 
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Nbte that in Figures 3, 4A and 4B; ^ 
as a cellular network and the mobile stations MS resemble cellular handsets. 
The Invention Is not limited to cellular networks, however, and can equally well 
be used in a WLAN environment, in which case the mobile stations ar'e re- 
5 placed by WLAN interface devices. - 

Statistical modelling 

Possible statistical models will now be studied In more detail. In 
general, a statistical model, as used in this context, can comprise several indi- 
vidual statistical submodels, |n which case the actual estimate is obtained by 

10 combining the individual results of the submodels. 

There are many possible statistical modelling approaches that can 
be used for producing the required statistical submodels. In the following we 
will focus on the probabilistic approach. A probabilistic model means that when 
estimating the location of the mobile temiinal, the result is represented as a 

15 probability distribution over the possible locations if location X is modelled as a 
discrete variable, whereas, if the location X is modelled as a continuous vari- 
able, the result Is represented as a density function. In the following, the focus 
will be on the discrete case. Similarly, the location-dependent measurements 
V can also be modelled either with discrete or continuous observational vari- 

20 ables. The number of dimensions of vector V (the number of measurements 
that can be obtained) varies and depends on the properties of the operating 
wireless network(s). 

Again, there are many probabilistic model classes that can be used. 
In the following preferred embodiment of the invention, the focus will be on pa- 

25 rametric probabilistic models. In this case a single model can be represented 
as a pair (M,e), where M denotes the model structure, ie the qualitative proper- 
ties of the model that detemnine which parameters are required, and 0 denotes 
the quantitative values of the parameters. 

There are tNO principal approaches for constructing parametric 

30 probabilistic models (M, 6) in the present context, namely conditional models 
and joint models. Conditional models are models that directly give probability 
distributions of the fomi P(X | V,M, 9), where V denotes the values of the ob- 
servational variables (for example, a vector consisting of signal strength 
measurements), and X denotes the location where observation V was made. 

35 Joint models define probability distributions P(X,V I M, 0) on events (X,V). 
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HoWevier, by using; the axi^^ 
P(X I V,M, 9) = P(X ,V I M, &0*fy I M, e>/where P(\/ 1 M. 0) does not depend 
on the location X. Thus we can treat the denominator P(V | M, 0) as a nornial- 
Izing constant. This means that we can always use a joint model for condh 
5 tipnai modelling, and in the following we will focus on joint modelling and re- 
gard conditional modelling as a special case. 

There are many ways to use parametric models in location estima- 
tion. Let us first assume that we have decided to use a single model structure 
M, and we wjsh to detennine the parameters from the calibration data CD 'so 
10 that we get a joint probabilistic model for events (X,V), which also gives, as 
described above, the required conditional distribution for location X given the 
observations V. As described in [Kontl^anen et al. 2000], there are several al- 
ternatives for producing the joint distribution: 

1 . We can use P(X, V j M, 9(0)), where 0(D) is the maximum liltelihood instan- 
15 tiation of the parameters, ie 9(D) = arg max P(D | M, 9). 

2. We can use P(XjV | M, 9(D)), where 9(D) is the Bayesian maximum poste- 
rior instantiation of the parameters, ie 9(D) = arg max P(9 1 M, D) 

3. We can use P(X,V | M, 9(b)), where 9(D) is the mean of the posterior distri- 
bution P(9 1 M, D). 

20 4. We can integrate over the parameters 9: P(X, V | D,M) = fP(X,V | D,M, 9)P(9 
|D,M)d9. 

5. We can use P(X, V | M, 9(D)), where 9(D) is the parameter instantiation op- 
timizing the minimum message length criterion described in [Wallace and 
Dowe 1999] and the references therein. 
25 In some special cases, altematives 3 and 4 are equivalent. 

In general, one may wish to use several model structures M. In the 
following, we will assume that we have fixed the general model family (set) F, 
the set of all the possible model structures under consideration. For example, 
the set F may correspond to the set of all possible Bayesian network models 
30 (see [Coweil et al. 1999], [Peari 1988]). In this caise we produce the predictive 
distribution P(X |V,F) by computing a weighted sum over all the models in F: 
P(X |V,F) oc 2 P(X , V I M) W(M). Possible weighting functions W include the 
following: 

1. The posterior of the model structure M, given the data: 
35 P(M I D) oc P(D I M)P(M) = P(M) J P(D |9,M)P(9 | M)d9. 
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2, The stochastic conipjexity of the da^^^^ given the nipidel structure M, and the 
approximations of the stochastic complexify c^ discussed in 
[Rissanen 1999] and the references therein. 

3. The minimum message length of the data, given the model structure M, and 
5 the approximations of the MML criterion, discussed in {Wallace and Dowe 

1999] and the references therein. 

it is also possible to use cfonditionai (supen/ised) versions of the 
weighting functions, in which case the weights are computed with respect to 
conditional modelling, and the actual data is taken to consist of only the values 

10 of the location variable X, and the measurement data V Is treated as "back- 
ground data". These alternatives are discussed in [Kontkanen et al 1999]. 

If the number of model structures M jn F is too high for computing 
the weighted sum in a feasible time, we have to restrict the model family F by 
perfomning a search in F, and pruning F to consist of only those model struc- 

15 tures that are the best with respect to some cost function. The possible cost 
functions for perfomning the search include the weight functions listed above. 
Any search algorithm can be exploited in this task. An extreme case of this 
type of restricting search is a case where only one single model structure M in 
F is chosen. In other words, the sum over model structures reduces to a single 

20 temri con^esponding to the use of a single model with the largest weight. 

If the obsen/ations V are modelled as discrete variables, the granu- 
larity of the discrete variables can be viewed as part of the model structure M. 
The granularity can either be fixed by the user (representing prior Infomnatlon), 
or as part of the model structure M, it can be learned from the calibration data. 

25 The optional prior infonnation, such as infonnatlon on the locations 

and radio parameters of the base statbns, represents knowledge other than 
that extracted from the calibration measurements. In the probabilistic setting, 
we can identify the following ways for coding the prior information: 

1. By choosing the initial model family F of probabilify models (determining the 
30 model structures considered, and with each model structure, the forms of 

the distributions used and the assumptions made). 

2. If the observational variables V are taken to be discrete, by choosing the 
granularity of the discretization. 

3. If the location variable X is taken to be discrete, by choosing the granularity 
35 ofthediscretizatiori. 
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" 4. By determlhihg the prior diistributlon P(0 | M) for the paiameters of the 
model M. 

5. By determining the prior distributfon P(M) for the model structuriss M in the 
family F. 

5 IViissing data 

There are several alternative procedures for handling missing data: 

1 . Treat 'missing' as an extra value for the variable in question. 

2. Ignore the missing entries (the sufficient statistics are computed from the 
existing data only) 

10 3. Estimate the missing values from the existing data and/or prior infonnation. 
The estimates can either be used for filling in educated guesses of the 
missing values, or they can be treated as partial observations (sufficient sta- 
tistics of several possible values can be simultaneously partially updated, 
according to, e.g., their estimated probabilities). 

15 4. Fill in the missing values by using random guesses. 

Location interpretation and reporting 

The result of the probabilistic location estimation can be reported In 
several different ways. First, we can divide the worthing area into several sub- 
areas in different ways: the subareas can either form a full partitioning of the 
20 wori< area, or they can cover only a portion of the whole wori< area. An exam- 
ple of the latter case is that only the locations listed in the calibration data D 
(with a desired accuracy) are considered. The result of the probabilistic loca- 
tion estimation can now be reported either 

1 . By giving the full probability distribution over the areas, ie, for each area X, 
25 give the con'esponding probability P(X | V,F). 

2. By giving the most probable subarea X with respect to the distribution P(X | 
V.F). 

3. By giving a point estimate minimizing the expected value of some error 
function with respect to the distribution P(X | V,F). 

30 An example of altemative 3 is the mean squared error, in which 

case the point estimate is the weighted average of the centre points of the su- 
bareas (assuming that the subareas are of equal size), the weights being the 
probabilities P(X | V.F). If the subareas X are not of equal size, the weights 
can be rescaled with respect to the relative size of the corresponding subarea, 

35 for example by multiplication. 
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Uncertainty about the receiver's locatiph.can be feduced by prior in- 
formation PI, if available, and/or the observation history OH. Let us assume 
that the above altemative 1 was chosen initially. In other words, the user or 
application requesting the location of the receiver is reported the full probability 

5 distribution. The probability distribution may indicate a number of feasible loca- 
tions. The prior information PI, If available, may indicate that only one of the 
locations Is possible, given the received cell identifiers or the like. Alternatively, 
the observation hisbry. OH can be used to exclude some locations. For exam- 
ple, although a number of locations could explain the receivei's cunrant loca- 

10 tion, only a subset of the locations can explain the entire observation history 
OH, given the receiver's finite speed. 

Performance examples 

Example 1 : Location estimation with the NaYve Bayes model. 

The subareas X under consideration are the locations where the 

15 calibration data was collected. The radius of the locations is assumed to be 
one meter, although any unit can be used. The observational variables V are 
taken to be discrete with m values. The value of m can be a constant (eg 3), or 
it can be optimized by using one of the weighting functions described above. 
The boundary points between the intervals can be detemiined so that the 

20 number of training samples within each interval is the same (equal-frequency 
discretization), or alternatively, the intervals can be made of equal width 
(equal-width discretization). The intervals can also be detemiined by using a 
clustering algorithm, such as the K-means algorithm. 

One model structure M is used: the observational variables Yi Vn 

25 are , assumed to be independent, given the value of the location variable X. 
The data is assumed to be independently and identically distributed (= "i.i.d."), 
and follow the Multinomial distribution with the assumptions described in [Gei- 
ger and Heckennan, 1998]. Prior information is non-existent A non- 
infomnative uniform prior distribution for the model parameters is used. 

30 Alternatives are discussed in [Kontkanen et al, 2000]. The distribution P(X,V | 
D,M) is computed by integrating over the parameters. With the above 
assumptions, this can be done as described in [Kontkanen et al., 2000]. 

In this experiment, the observation history OH is taken into account 
by treating the eight (other numbers are equally possible) last signal meas- 

35 urements as a single measurement vector V so that the eight individual meas- 
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Urements are assumed to be independent of eaeh Other. The result is given as 
a point computed as a weighted average of the eehtre poihtis of the subareas, 

where thiB weight for area X Is P(X I VAM). 

This method was Implemented and tested empirically In downtown 

5 Helsinki by exploiting the signal strengths of Sonera GSM networl<. The work 
area was approximately 400 x 500 meters in size, and the calibration data was 
collected outside in the streets in approximately 50 evenly distributed points. 
The average distance between two measurement locations was approximately 
50 meters. The system was tested by using the location estimator in 300 ran- 

10 domly situated locations within the work area. The average location enor in 
this test was 42 meters. 

Example 2: Location estimation with a Mixture of Histograms model. 

The location variable X is taken to consist of two coordinates (one 
or three coordinates are also possible) on a fine-grained, discrete scale. The 

15 resolution of the scale can be, for instance, one meter. The observational vari- 
ables Vi Vn are taken to be discrete with the maximum resolution deter- 
mined by the measuring device, e.g. 1 dBm. We denote the combination of Vi, 
Vn by V. Missing values are replaced by a value which is smaller than any 
possible observable value. Several models are considered. Each model Mm is 

20 associated with parameters k, I, and Gw whose semantics is described below. 

Figure 5 illustrates the structure of model Mw. The value of variable 
Xk is obtained from the value of variable X via discretization into k values. The 
conditional distribution of variables Vi(l) Vn(l) given the value of Xk is de- 
scribed by model parameters Gw . Each Vi, where i belongs to the set {1 , .... n}, 

25 is unifonnly distributed within the interval defined by the value of variable Vi(l). 
A low-resolution location variable Xk is derived from the fine-grained location 
variable X by discretizing the latter with equal width discretization (other discre- 
ti^tion methods also possible) using k bins, ie k possible values. Whenever a 
boundary point of the low-resolution discretization appears between two 

30 boundary points of the fine-grained discretization, the mass (i.e. the number of 
observations within the sub-interval) is divided according to the relative size of 
the overtapping Intervals. For instance, let the fine-grained discretization have 
5 bins (4 boundary points) within the range [0, 10]. Let the low-resolution dis- 
cretization have 2 bins, and therefore one boundary point at the value 5. If 

35 there are n observations within the range [4, 6], i.e. within the third bin of the 
fine-grained discretization, then both low-resolution bins get n/2 observations, 
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because the boundary point 5 spirts the range [4^ 6] Into two parts of equal 
size. Likewise, each observational variable V| Is discretized using I possible 
values, thus obtaining a low-resolution variable Vi(l). 

l\^odel Mki describes the conditional probability functions P(V(I) | X^, 

5 Mki, Old), where Gm denotes the model parameters of model Mki- The low- 
resolution observational variables Vi(l), Vn(l) are taken to be Independent, 
given the value of the location variable Xk. For each I belonging to the set {1 , 
.... n}, the distribution P(V|(I) ) Xk, Mm, Bm) is taken to be i.i.d., and follow the 
Multinomial-Drrichlet distribution with the assumptions described in [Qeiger 

10 and Heckemidn, 1998]. Prior Infonnation is rionexistent. A unifomi prior distri- 
bution over models Mm is used. A non-informative equivalent sarnple size 
(ESS) prior distribution for ttie model parameters is used as described In 
[Fieckerman, 1995]. A second-order prior for ttie ESS parameter is used, e.g., 
a unrfomd distribution over the set {1, 10}. For each i belonging to the set {1, 

15 n}, the distribution P(V|(I) | Xk, Mm) is computed by integrating over the model 
parameters. With the above assumptions, this can be done as described in 
[Kontkanen et al., 2000]. 

The distribution P(Vi | Vi(l)) is taken to be uniform over the interval 
defined by the value of Vi(l) and the discret'rzation of Vi defined by parameter I. 

20 For instance, let tiie range of Vi be [0, 10], let the valire of I be 5, and let the 
value of Vi(i) be 2. Assuming that equal vtndth discretization is used, the values 
of Vi are discretized into five intervals, [0, 2], [2, 4], [4, 6], [6, 8], and [8, 10]. 
Now, given that tiie value of Vi(l) is 2, the distribution P(Vi | Vi(l)) Is unifomi 
over the interval [2, 4]. The variables Vi, Vn are taken to be independent of 

25 each other g'rven the values of variables Vi(l), Vn(l). 

Combining the two distributions P(V(i) | Xk, Mw) and P(V | V(l)), we 
obtain a conditional distribution P(V | Xk, Mw). The distribution P(V | X, D) is 
computed as a weighted average over the models Mm, where k and i vary over 
the set {1, 20} (other chorees equally possible). The models are weighted 

30 by the marginal likelihood P(V(D) { Xk(D), Mm), where tiie calibration data (with 
n observations) consists of the vectors V(D) = (V^D), V"(D)) and Xk(D) = 
(Xk'(D).....Xk''(D)). 

With these assumptions, tiie marginal likelihood can be computed 
efficiently in two parts: First, the product of the temis with the forni P(V(I) | Xk, 

35 Mm) can be computed as described in [IHeckenman, 1995] and [Geiger and 
Heckemnan, 1998]. Secpnd, the terms with the form P(V | V(l)) have the same 
value, which is a constant depending on I, because the distiibution P(V | V(l)) 
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Is' uniform. The result Is given as a posterior probability dl^bution pp( | V, D) 
= P(V I X, D]tP(X I P) / P(V I P) over the location variable X. The distribution 
P(X I D) is taken to be unlfonri; "Hie temn P(V | D) is a nomnalizing factor 
whose value is Ignored. Instead the resulting distribution P(X | V, D) is normal- 

5 ized so that it sums up to one. 

The method described iabove was implemented and tested empiri- 
cally in Helsinl<i on the second floor of the building at address Teolllsuuslotu 
23 by using a laptop computer measuring the WLAN signal striengths through 
a WLAN PC card. The wori( area was approximately i20 x 45 meters (900 

10 square meters) In isize. Calibration data was collected in 12 arbitrary places, 
and the total number of data vectors In it was 204. The system was tested by 
using the location estimator in 25 randomly chosen locations witiiin tiie work 
area. Location estimation was repeated five times in each location. When the 
above system was used for detennining the location area with 95% of the 

15 probability mass, the correct place was In this area 77 % of the time. The av- 
erage size of the 95 % probability mass area was approximately 151 square 
meters, i.e. about 1 7 % of the total area. 
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Claims 

1. A method for estimating a location pC) of a receiver (R, R') in a 
wireless telecornmunlcation environment (RN), the telecommunication envi- 
ronment comprising a plurality of channels for simultaneous communication, 

5 each channel having at least one signal paranieter (V) that varies with location 
(X) differently from the other channels; 

characterizejd in that the method comprises the steps of: 
for each of a plurality of calibration points in the wireless telecom- 
munication environment, detennining a set of calibration data (CD), each set 
10 of calibration data comprising the location (X) of the respective calibration 
point and at least one measured signal parameter (V) for each of several 
channels at that calibration point; 

on the basis of the sets of calibration data (CD), maintaining a sta- 
tistical model (SM) of the signal parameters (V) of the several channels versus 
15 a receiver's location in the wireless telecommunication environment (RN); 

determining a set of obsen/ed signal parameters (CO), the set com- 
prising at least one observed signal parameter (V) for each of several chan- 
nels at the location (X) of the receiver (R, R'); and 

determining a location estimate (LE) approximating the location (X) 
20 of the receiver (R, R') on the basis of the statistical model (SIVI) and the set of 
observed signal parameters (CO). 

2. A method according to claim 1, characterized by the re- 
ceiver (R) sending the set of observed signal parameters (CO) to an extemal 
location estimation module (LEM) which sends the location estimate (LE) to 

25 the receiver. 

3. A method according to claim 1, characterized by the re- 
ceiver (R') storing a copy of the statistical model (SM) and determining the lo- 
cation estimate (LE) on the basis of the copy of the statistical model (SM). 

4. A method according to any one of the preceding claims, cha- 
se racterizedby maintaining the stetistical model (SM) also on the basis of 

prior infonnatlon (PI) about the wireless environment's (RN) infrastructure. 

5. A method according to any one of the claims 1 to 4, characte- 
rized in that the statistical model (SM) is or comprises a probabilistic model, 
preferably a Bayesian model. 
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6. A method according to clairti 5, ch a ra cterize 
tistical model (SM) is or comprises a Bayesian network model. 

7. A method according to any one of the preceding claims, cha- 
racterized in that the signal parameters (V) in the statistical model (SM) 

5 are independent of each other, given the location (X). 

8. A method according to any one of the preceding claims, cha- 
racterized by reducing uncertainty concerning the receiver's location on the 
basis of a history (OH) of the observed signal parameters. 

9. A method according to any of the preceding dainis, characte- 
10 rized by modelling at least some of the signal parameters (V) by discrete 

variables whose values cpnrespond to intervals or unions of intervals on the 
range of possible signal parameter values. 

10. A method according to any of the preceding claims, charac- 
terized by modelling the location (X) as a discrete variable. 

15 1 1. A location estimating apparatus {LEM) for estimating a location 

(X) of a receiver (R, R') in a wireless telecommunication environment (RN), the 
telecommunication environment comprising a plurality of channels for simulta- 
neous communication, each channel having at least one signal parameter (V) 
that varies with location (X) differently from the other channels; 

20 characterized by: 

a model construction module (MCM) for: 

- receiving a set of calibration data (CD) for each of a plurality of 
calibration points in the wireless telecommunication environment, each set of 
calibration data comprising the location (X) of the respective calibration point 

25 and at least one measured signal parameter (V) for each of several channels 
at that calibration point; and 

- maintaining, on the basis of the sets of calibration data (CD), a 
statistical model (SM) of the signal parameters (V) of the several channels 
versus a receiver's location in the wireless telecommunication environment 

30 (RN); 

and a location calculation module (LCM) for 

- receiving a set of observed signal parameters (CO), the set com- 
prising at least one observed signal parameter (V) for each of several chan- 
nels at the location (X) of the receiver (R, R'); and 
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- determining a location estimate (LE) approximating the location 
(X) of the receiver (R, R') on tiie basis of the statistidal model (SM) and the set 
of observed signal parameters (CO). 

12. A receiver (R, R') comprising means for detennining sets of ob- 
5 served signal parameters (CO), each set comprising at least one observed 

signal parameter (V) for each of several channels at the location (X) of the re- 
ceiver (R), characterized by means for conveying the sets of observed 
signal parameters (CO) to a location calculation module (LCM) for determining 
a location estimate (LE) approximating the location (X) of the receiver (R) on 
10 the basis of said sets and a statistical model (SM) of the signal parameters (V) 
of the several channels versus a receiver's location in a wireless telecommuni- 
cation environment (RN). 

13. A receiver (R') according to claim 12, characterized by 
comprising the location calculation module (LCM). . 

15 14. A receiver (R) according to dalm 12, characterized In that 

tiie means for conveying the sete of observed signal parameters comprises 
means (Rl) for conveying the sets to an external location calculation module 
(LCM). 

15. A receiver according to any one of claim 12 to 14, characte- 
20 rlzed In that at least some of the sets of observed signal parameters (CO) 
relate to networks the receiver Is not attached to. 
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